Estimating Bayesian Phylogenetic Information Content
نویسندگان
چکیده
Measuring the phylogenetic information content of data has a long history in systematics. Here we explore a Bayesian approach to information content estimation. The entropy of the posterior distribution compared with the entropy of the prior distribution provides a natural way to measure information content. If the data have no information relevant to ranking tree topologies beyond the information supplied by the prior, the posterior and prior will be identical. Information in data discourages consideration of some hypotheses allowed by the prior, resulting in a posterior distribution that is more concentrated (has lower entropy) than the prior. We focus on measuring information about tree topology using marginal posterior distributions of tree topologies. We show that both the accuracy and the computational efficiency of topological information content estimation improve with use of the conditional clade distribution, which also allows topological information content to be partitioned by clade. We explore two important applications of our method: providing a compelling definition of saturation and detecting conflict among data partitions that can negatively affect analyses of concatenated data. [Bayesian; concatenation; conditional clade distribution; entropy; information; phylogenetics; saturation.].
منابع مشابه
A Bayesian phylogenetic approach to estimating the stability
Supplementary data tml http://rspb.royalsocietypublishing.org/content/suppl/2010/08/27/rspb.2010.1595.DC1.h "Data Supplement" References http://rspb.royalsocietypublishing.org/content/278/1704/474.full.html#ref-list-1 This article cites 30 articles, 10 of which can be accessed free Subject collections (2289 articles) evolution (406 articles) cognition Articles on similar topics can be fou...
متن کاملEstimation of Products Final Price Using Bayesian Analysis Generalized Poisson Model and Artificial Neural Networks
Estimating the final price of products is of great importance. For manufacturing companies proposing a final price is only possible after the design process over. These companies propose an approximate initial price of the required products to the customers for which some of time and money is required. Here using the existing data of already designed transformers and utilizing the bayesian anal...
متن کاملPhylogenetic estimation of timescales using ancient DNA: the effects of temporal sampling scheme and uncertainty in sample ages.
In recent years, ancient DNA has increasingly been used for estimating molecular timescales, particularly in studies of substitution rates and demographic histories. Molecular clocks can be calibrated using temporal information from ancient DNA sequences. This information comes from the ages of the ancient samples, which can be estimated by radiocarbon dating the source material or by dating th...
متن کاملSupervised Learning and Bayesian Classification
This document discusses Bayesian classification in the context of supervised learning. Supervised learning is defined. An approach is described in which feature likelihooods are estimated from data, and then classification is done by computing class posteriors given features using Bayes rule. Estimating of feature likelihoods, independence of features, quantization of features, and information ...
متن کاملEstimating E-Bayesian of Parameters of two parameter Exponential Distribution
In this study, E-Bayesian of parameters of two parameter exponential distribution under squared error loss function is obtained. The estimated and the efficiency of the proposed method has been compared with Bayesian estimator using Monte Carlo simulation.
متن کامل